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Abstract — Renewable energy sources can be attached to sensor 
nodes to provide energy replenishment for extending the battery 
life and prolonging the overall lifetime of sensor networks. For 
networks with replenishment, conservative energy expenditure 
may lead to missed recharging opportunities due to battery 
capacity limitations, while aggressive usage of energy may result 
in reduced coverage or connectivity for certain time periods. 
Thus, new power allocation schemes need to be designed to 
balance these seemingly contradictory goals, in order to maximize 
sensor network performance. In this paper, we study the problem 
of how to jointly control the data queue and battery buffer to 
maximize the long-term average sensing rate of a wireless sensor 
network with replenishment under certain QoS constraints for 
the data and battery queues. We propose a unified algorithm 
structure and analyze the performance of the algorithm for all 
combinations of finite and infinite data and battery buffer sizes. 
We also provide a distributed version of the algorithm with 
provably efficient performance. 

I. Introduction 

Wireless sensor networks have been widely used in mon- 
itoring [1], maintenance [2], and environmental sensing [3]. 
The lack of easy access to a continuous power source and 
the limited lifetime of batteries have hindered the wide-scale 
deployment of such networks. However, new and exciting 
developments in the area of renewable energy [4] [5] provide 
an alternative to a limited power source, and may help alleviate 
some of the deployment challenges. These renewable sources 
of energy could be attached to the nodes and would typically 
provide energy replenishment at a slow rate (compared to the 
rate at which energy is consumed by a continuous stream of 
packet transmissions) that could be variable and dependent on 
the surroundings. 

Energy management in networks equipped with renewable 
sources is substantially and qualitatively different from energy 
management in traditional networks. For example, conserva- 
tive energy expenditure could lead to (i) very long delays 
because the energy is not being fully used to transmit at high 
enough data rates, and (ii) missed recharging opportunities 
because the battery buffer is full. On the other hand, an over 
aggressive use of energy may lead to lack of coverage or 
connectivity for certain time periods. Further, if the battery of a 
node discharges completely, it could be temporarily incapable 
of transferring time-sensitive data. This may have undesirable 
consequences for many applications. Thus, new techniques and 



protocols must be developed for networks with replenishment 
to balance these seemingly contradictory goals. 

In this paper, we first consider a single link communication, 
in which the transmitting node has a data buffer that holds 
the incoming variable-rate sensing data and a battery buffer, 
which is being replenished at a variable rate. In our model, we 
allow any combination of finite and infinite data and battery 
buffer sizes. If the data buffer size is infinite, the concern is 
the stability of the data queue, while for finite data buffer, 
excessive data losses is the undesirable event. Likewise for 
the battery buffer, frequent occurrences of battery discharge 
should be avoided. We investigate the problem of maximizing 
the long-term average sensing rate, subject to the constraints 
on the stability of the data queue (or the desired data loss 
ratio when the data buffer size is finite) and the desired rate 
of visits to zero battery state. We provide a simple and unified 
joint rate control and power management framework, and show 
that the performance of our scheme is close to optimum. 
We then extend our algorithm to the multihop network with 
multiple source destination flows under the primary or node- 
interference model that has been shown to be a good model 
for Bluetooth of FH-CDMA systems. 

While the problem of energy management in sensor net- 
works has seen considerable attention, there have been few 
works [6], [7], [8], [9], [10], [11], [12], [13] that also include 
energy replenishment. In [6], the authors consider the problem 
of dynamic node activation in rechargeable sensor networks. 
They provide a distributed threshold policy that achieves a 
performance within a certain factor of the optimal solution 
for a set of sensors whose coverage area overlap completely. 
In [7], the authors study the problem of computing the 
lexicographically maximum data collection rate for each node 
such that no node will be out of energy. In [8], the authors 
consider the problem of energy-aware routing with distributed 
energy replenishment. They provide an algorithm that achieves 
a logarithmic competitive ratio and is asymptotically optimal 
with respect to the number of nodes in the network. Prior work 
on power allocation for wireless networks without replenish- 
ment has been widely studied, e.g., [14], [15], [16]. In [15], 
[16] the authors assume that the data buffer is large enough 
so that packet loss does not occur and provide a dynamic 
programming based solution. In [14], the authors develop 
approximate algorithms to minimize the average allocated 
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power, or maximize the throughout given average power 
constraint, and at the same time keep the data queue stable. 
The approximation improves at the cost of increasing the data 
queue length and queueing delay. Most of these works assume 
a constant energy supply, i.e., there is no battery issue. The 
idea of [12] and this paper are motivated by [14]. We both 
model energy replenishment and consider jointly managing the 
data and battery buffers. This coupling between the data and 
battery buffers is what makes the problem notoriously difficult 
to solve using standard optimization based approaches. For 
example, unlike [14] that utilizes the fact that a static allocation 
policy is optimal, it is not even clear whether a static policy 
would even be optimal in our setting. Specifically, with a 
battery buffer, there is an additional energy constraint that 
the allocated energy should be within the battery state, and 
this constraint is even more difficult than the average power 
constraint. In [12], the authors consider infinite data buffer and 
finite battery buffer sizes. They assume that the replenishing 
process is i.i.d (for non i.i.d process, they claim that a K slot 
analysis can be applied, which is complex and depends on the 
network size), and show that the probability of battery state 
being less than the peak power or close to the full battery state 
vanishes as the battery size grows, under an index policy. In 
this paper, we construct a framework that accommodates all 
combinations of finite and infinite data and battery buffer sizes 
by defining minimum number of virtual queues in a general 
format. In addition to the constraint on the stability of the data 
queue (constraint on the data loss ratio when the data buffer 
size is finite), we also have a constraint on the frequency of 
battery discharge. Rather than assuming an i.i.d replenishing 
process as in [12], we allow for a general replenishment 
process without assuming ergodicity and carefully design the 
virtual queues and explore the relations between actual queues 
and virtual queues to show that our algorithm is efficient. 
Further, we design a distributed algorithm in detail that is 
based on the notion of imperfect scheduling first developed 
in [17]. 

The main intellectual contributions and challenges of the 
paper are as follows: 

• We formulate a sensing rate maximization problem with 
QoS constraints on both the data and battery queues. Due 
to the coupling between the battery and data buffers, a 
stationary policy could be suboptimal, hence traditional 
resource allocation techniques do not directly apply. 
Further, dynamic programming based solutions result in 
prohibitively high complexity, even for the single-link 
scenario. Nonetheless, we are able to develop a simple 
and unified framework for all combinations of finite and 
infinite data and battery buffer sizes, and our algorithm 
is provably efficient. 

• We extend the algorithm from a single-link case to 
a multi-hop network under node-exclusive interference 
model with practical settings and develop an efficient 
distributed algorithm based on imperfect scheduling. 



II. Single Link Model 

We first consider a single link control model in this paper, 
as illustrated in Figure 1. We assume a time slotted system and 
in time slot t, the amount of data available for the sensor node 
to sense is denoted by A(t), which is upper bounded by A max , 
(0 < A max < oo). In the same time slot, the actual amount 
of data the node senses and places in the data buffer is R(t). 
The amount of energy expended by the node at time t for 
data transmission is P(t), which is upper bounded by P pe aic, 
(0 < Ppeak < oo) and the achievable data rate at that power 
level is /i(P(t)), where we assume /i(-) to be monotonically 
increasing, reversible and differentiable on the half real line 
3? + 1J{0}. The node has a battery of size B b (either B b < 
oo or B b = oo) with zero initial battery state. We let r(t) 
denote the amount of replenishment energy arriving to the 
node at time t. The energy state of the battery and the data 
state of the data buffer at time t is given by q b (t) and qd(t), 
respectively. The data buffer has size B d (again, either Bd < 
oo or Bd = oo). Under this setting, we describe the objective 
in the following section. 
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Fig. 1. Single Link Control Model 



A. Problem Formulation 

Our general objective is to maximize the long-term average 
sensing rate, subject to the QoS constraints on both data and 
battery queues: 



(A) 



1 T_1 

maxliminf — > R(t) 
P,R T^oo T ^ w 



t=o 



s.t. qd(t + 1) = min 



(q d (t)- p(P(t))) + + R(t),B d 



q b (t + 1) = min \q b (t) - P(t) + r(t),B b 
0 < R(t) < A(t), 
0 < P(t) <min [<? 6 (t),Ppeak 
Po < Vo, 



1 T_1 

limsup - V" q d {t) < oo (B d = oo), or 
Pd < Vd (B d < oo), 



(1) 
(2) 
(3) 
(4) 
(5) 
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where (•)+ = max[-, 0], R = {R{0), R(l), . . . , R{T- 1), . . .} 
is the actual sensing data vector, P = {P(0), P(l), . . . , P(T- 
1), . . .} is the allocated power vector, and 



Pd = lim sup 



p 0 = lim sup 

T^oo 



(7) 



(8) 



are the long-term data loss ratio with an upper bound 
and the frequency of visits to zero battery state with given 
threshold rj 0 respectively, where 

D(t) = ((q d {i) - fi(P(t))Y + R(t) - B d ^j , (9) 



^o(^) — 1 battery state '0' is visited from higher states in slot t 

0 if P{t) = 0 or P(t) < q b {t) 

1 otherwise 



(10) 



are the amount of data loss in slot and the indicator that the 
battery discharges completely in time slot t, respectively. Note 
that, if the battery is completely discharged at time t and there 
is no replenishment at slot t, we do not consider a complete 
discharge event occurring at time t + 1 as well. Further note 
that, we do not assume ergodicity of the system parameters, 
but if they are ergodic, then p Q represents the actual probability 
of a complete discharge event as t — > oo. 

In Problem (A), Constraints (1) and (2) describe how the 
data and battery queues evolve, respectively. Especially, if 
Bd = oo, (1) can be simplified to 



Qd(t+1) = (q d (t) - n{P(t))) 
and if Bb = oo, (2) can be simplified to 



R(t), 



q b (t+l) = q b (t)-P(t) + r(t). 



(2') 



Constraint (3) bounds the actual amount of sensed data R(t) 
by the amount of available data A(t) in slot t. Constraint 
(4) states that we cannot oversubscribe the energy that is 
unavailable in the battery nor can we exceed the peak power 
level. Constraint (5) is the battery QoS constraint i] 0 of 
the desired battery discharge rate. Constraint (6) is the QoS 
constraint for data queue: if Bd = oo, we need to keep the data 
queue stable, and if Bd < oo, the data loss ratio is required 
under a given threshold r]d- 

Problem (A) has an inventory control structure, and typically 
such a structure can be solved optimally using dynamic 
programming, albeit with high complexity. Furthermore, de- 
pending on the exogenous processes {A(t), t > 0} and 
{r(t), t > 0}, Problem (A) may not have a feasible solution, 
i.e., there exists no power allocation policy {P(t), t > 0} 
that satisfies all constraints simultaneously. In this paper, our 
purpose is to develop a simple algorithm, which performs 
arbitrarily close to the performance of the optimal power 
allocation, whenever a feasible solution exists. To achieve that 
purpose, we do the following: 



• define virtual queues for both the data and battery buffer 
to avoid the difficulties involved in dealing with the data 
loss and battery discharge probability directly. We show 
that keeping the virtual queues stable ensures that the 
constraints on pd and p a are met. 

• design a power allocation scheme based on simple index 
policies and show that our scheme keeps both virtual 
queues stable and at the same time performs arbitrarily 
close to the optimal performance. 

• generalize our algorithm to the multihop scenario and 
develop distributed algorithms. 

B. Virtual Queues 

We define qd and q b as the virtual data and battery queues. 
The virtual queues evolve according to the following Lindley's 
queue evolution equations: 

Qd(t + 1) = ((&(*) - VdR(t)) + - »(P(t)) + R(t) + + , 

(11) 

q b {t + 1) = ((q b {t) -Vo) + + P{t) - r{t) + M{t) + i 0 (t)) + , 

(12) 

where I(t) — (fj,(P(t)) — qd{t)) + is the amount of transmitted 
idle packets when there is no enough data to transmit using 
the allocated energy, M(t) = {q b {t) - P(t) + r(t) - B b )+ is 
the amount of missed replenishing energy due to full battery 
when B b < oo, and l 0 (t) is defined in Equation (10). Note 
that if B b < oo, M(t) = 0 and Equation (12) reduces to 

(t + l)= ({q b (t)-Vo) + + P(t)-r(t) + l 0 (t)y. (13) 

Without loss of generality, the initial state q b (0) and q~d(fy 
can be set to be zero. The following proposition shows that 
if the virtual queues qd(t), q b (t) and the actual battery queue 
q b (t) are all strongly stable, pd and p a are guaranteed to meet 
their constraints. 

Proposition 1: If the virtual queues Qd{t), q b (t) and the 
actual battery queue q b (t) are both strongly stable, i.e., 

1 T_1 

limsup - V q d (t) < oo, 

T-l 



96 



limsup I- Qb{t) < oo, 



1 



t=o 

T-l 



limsup ^ V q b (t) < oo, 
then p d < r]d and p 0 < rj 0 . 

The proof of this result can be found in Appendix A. Next, 
we present our scheme. 

III. Joint Rate Control and Power Allocation 
Algorithm 

A. Algorithm 

The algorithm consists of two components: a rate control 
component and a power allocation component. Both compo- 
nents are index policies, i.e., the solutions are memoryless 
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and they depend on the instantaneous values of the system 
variables. 

Rate Control (RC): 
We define 0 < V < oo to be the control parameter of our 
algorithm. Let Q d {t) = qd(t) when Bd = oo, and let Qd{t) = 
q~d{t) when B d < oo. If Qd(t) < \, the transmitting node 
chooses to sense all the available data, i.e., R(t) = A(t). 
Otherwise, R(t) = 0. 

Power Allocation (PA): 
Solve the following optimization problem 



max Q d (t)fi(P(t))-q b (t)P(t), 
P(*)en(t) 



(14) 



where U(t) = {P(t) : 0 < P(t) < mm[q b (t), P pea k]} is a 
compact and nonempty set. Allocate P{t). 

The set, II(t) of possible power allocations guarantees 
Constraint (d) on P(t) in Problem (A). If /i(-) is concave , the 
objective function is a concave function of P(t). Consequently, 
PA solves a simple convex optimization problem in each time 
slot. The positive term Qd(t)fi(P(t)) can be viewed as the 
utility of allocating power P(t) and the term q b (t)P(t) can 
be viewed as its associated cost. When the control parameter 
V is chosen to be large, Qd{t) tends to be large according to 
RC, and PA tries to allocate higher power P(t) to increase the 
utility, whereas, when the virtual battery queue length q b (t) is 
large, PA avoids allocating a high amount of power to reduce 
the cost. Thus, this index policy of PA can be viewed as a 
greedy profit maximization scheme. 

B. Performance Analysis 

Recall that A(t) is the available amount of sensing data 
and R(t) is the actual amount of data the transmitting node 
chooses to sense. Clearly, using the rate controller, we make 
sure that the data queue remains within a certain bound and 
this has a positive effect on the battery as well, since a certain 
portion of the data packets are not allowed into the transmitting 
node. The natural question one would ask here is, whether our 
rate controller rejects too many packets in the first place to 
synthetically meet the constraints. In the following theorem, 
we show that this is not the case. Indeed, if there exists a 
solution, A* = liminfV^oo ^ J2t=o ^*(*) to Problem (A) for 
the exogenous process {A(t), t > 0} and the replenishment 
process {r(t), t > 0}, then the sensing rate associated with RC 
and PA can be made asymptotically closer to A* by increasing 
the control parameter V with increasing B d and B b . We use 
the notation y = O(x) to represent y going to 0 as i goes to 
0. 

Theorem 1: If the following conditions hold: 

(1) ju(-) is concave on 3? + 1J{0}, and its slope at 0 satisfies 1 

0 < (3 = m'(0) < oo, 

(2) r{t) > 0, for all t > 0, and f = 
limsup-j^ i J2j=o r(t) < P pea k, 

1 For instance, consider the additive white Gaussian noise channel 
capacity, fJ.(P) = log(l + P/No), where No is the two sided noise power 
spectral density. 



(3) A feasible solution to Problem (A) exists and the optimal 
instantaneous sensing rate is R*(t), 

then the joint power allocation and admission control algo- 
rithm (with RC and PA) achieves: 

Qd(t) < ^+A m . dX , Vt>0 (15) 
q b (t) < [3 + A m . dX Y Vt>0 (16) 



q b (t) < oo, V t > 0 



(17) 



T-l 



T-l 



lim inf - V R(t) > lim inf - V R* (t) - Of — v 



t=0 



t=0 



where ^ max 
rate, and 



-Vo(^ + l3)-g(V,B d ,B b ), (18) 
M^peak) is the upper bound for the transmission 



g(V,B d ,B b ) = 



0( 

oC 



V 

(¥-B d )+- 



if B d < oo, B b < oo 
if Bd = oo, B b < oo 
if B d < oo, B b = oo 

if B d < oo, B b < oo. 

The proof of Theorem 1 can be found in Appendix B. 
From Equation (15), Equation (16), and Equation (17), the 
virtual queues q d (t), q b (t) and the actual battery queue q b (t) 
are all strongly stable. Thus, by Proposition 1, Pd < Vd an d 
Po < Vo- In Theorem 1, V is a finite tunable approxima- 
tion parameter that controls the efficiency of the algorithm. 
Observe Equation (18), which compares the performance of 
our algorithm with that of the optimal solution of Problem 
(A), the term rj 0 (fi mRX + j3) captures the influence of battery 
outage, and it is small since the battery outage threshold rj a is 
usually set to be very small to avoid network disconnection. 
Function g(V, Bd, B b ) represents the asymptotical property of 
the gap. If both battery and the data buffers are of infinite 
size, then the performance gap is identical to 0. Otherwise 
the performance gap is dictated by the finite (or the smaller) 
one of the buffers. One can observe that, by appropriately 
choosing the ^-parameter, we can make the performance gap 
decay inversely proportional to the buffer sizes. 

IV. Multihop Network Model 



i( Awo A 2 (o^Jk^)) 





Fig. 2. Multihop Network Model 

We consider a multihop wireless sensor network with N 
nodes and L links as illustrated in Figure 2. Each node 
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n £ N = {1,2, ...,N} is attached to power sources for 
replenishment. Let A n (t) and Rn(t) denote the amount of 
available data for sensing and the actual amount of sensed 
data, to node n that are destined to node e in slot t. We 
assume that each node n maintains an infinite data buffer 
with state q^' e (t) for flows destined to e, and also maintains 
a finite battery buffer with size B% and state (We focus 

on infinite data buffer and finite battery buffer in order to 
emphasize the difference from single link to multihop. For 
other combinations of data and battery buffer sizes, then 
extension can be made similarly without too much effort). 
Let r n (t) denote the replenishment at node n in time slot 
t. The transmit power is chosen to be Pi(t) over link I. In 
the formulation, we assume that the power the receiving node 
consumes to receive and decode the packet is identical to 
Pi(t) as well. The sole reason for this is simplicity and the 
generalization to the asymmetric case is straightforward. We 
use the node-exclusive interference model. Under this model, 
a node can only receive from or transmit to at most one node 
at any time slot. In each time slot t, with the assigned power 
Pl(t), the achieved data rate at link I is m(Pi(t)) in that time 
slot, where the rate function /ij(-) is a non-decreasing, concave 
and differentiable function satisfying m(0) = 0. Let p™ be 
the frequency of visits to the zero battery state for node n. 
Let fi„ and ©„ denote the set of directed links originated 
from node n and terminate at node n, respectively. We say 
P = [Pi (t) • • • P L (f)] satisfies the node-exclusive model if 
Pi(t) > 0 for some I e Q n U @„, then P v = 0 for all 
V € (fi n U 9 n ) \ {I}- ln a multihop network, we formulate the 
joint queue and energy management problem as follows: 

1 T_1 

(B) maxUmrnf-^ ]T R e n (t) 

P ' R t=0 n.eeJ^ 

s.t. P(t) satisfies the node-exclusive model, (19) 



# e (* + !) < (# e w E tf{m))) + + Kit) 



(20) 



lE@„ 



«f (*) - E W)+r n (t),B£ 
;eo„ue„ 



qt(t+l) = min 

0 < E Pl (*) ^ lb (*) , 0 < Pi (t) < P peak , 
zefi„ue„ 

AT 

5>f(i5(t))= w (fl(t)), i£(t)<^(t), 

e=l 

1 T_1 

limsup- ^ ^' e (i) < oo, n^e, 



T^oo 



Po < no, 



t=0 



(21) 
(22) 

(23) 

(24) 
(25) 



where jj™ is the desired upper bound for p", is the 

power assignment vector for all links in slot t, P is the power 
assignment for all links over all time slots, and R is the actual 



sensing data vector for all node-destination pairs over all time 
slots. In Problem (B), the objective is to maximize the long- 
term average total sensing rate for all nodes destined to all 
destinations. 

In Problem (B), (19) is the interference constraint. Con- 
straints (20) and (21) describe how the data and battery queues 
evolve, respectively. Note that the destination node of each 
flow does not need to maintain a data buffer for that flow, as 
indicated in (20). Constraints (22) are the energy conservation 
equations stating that we cannot oversubscribe the energy that 
is unavailable in the battery nor can we exceed the peak power 
level. Constraints (23) are the rate conservation equations that 
bound the actual amount of sensed data R n (t) by the available 
amount of data A n (t), and share the transmission rate of a link 
among all the destinations in slot t. Constraint (24) is the QoS 
constraint for data queue: we need to keep all the data queues 
stable. Constraint (25) is the battery QoS constraint of the 
desired battery discharge rate jj™. 

Similarly, we define virtual queues for all n e Jt 



#(* + !)= ((#(*) -»£ 

+ M n {t) + l n 0 {t)) + , 



W)-r n (t) 

ien n ue„ 



(26) 



where M n (t) = (q£(t) - E ;e o„ue„ W + r n (t) - B£) + 
is the amount of missed replenishment and 

7™(t) = indicator that battery state '0' is visited from higher 
states in slot t for node n 

0 if E ief2 „ u e„ PiW = 0 or £*en„ u e „ Pi (*) 
< 

1 otherwise 

Next, we generalize the main results and algorithms RC and 
PA to the multihop scenario. 

Corollary 1: If all the virtual battery queues qg(t), Vn e N 
are strongly stable, we have < 77™, Vn € N. 
The proof is identical to the single hop scenario and can be 
found in Appendix C. We give the algorithm in the following 
section. 

V. Joint Rate Control, Power Allocation and 
Routing Algorithm for Multihop Networks 

The joint rate control, power allocation and routing algo- 
rithm for multihop networks can either be implemented in a 
centralized or distributed manner. For the centralized solution, 
we use the classical Maximal Weighted Matching (MWM) 
based algorithm and for the distributed algorithm, we can use 
the Maximal Matching (MM) based algorithm [17]. 

Multihop Rate Control (MRC): 
Depending on whether Maximum Weight Matching or Maxi- 
mal Matching is employed by the scheduler, there is a slight 
difference in the implementation of MRC: 
Maximum Weighted Matching (MWM): If <?£' e (i) < \ , node 
n chooses to sense all the available data packets, i.e., R n (t) = 
A^(t); otherwise, reject all the arrivals, i.e., R n (t) = 0. 
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Maximal Matching (MM): If q^' e (t) < V , node n chooses 
to sense all the available data packets, i.e., i?^(t) = A^(t); 
otherwise, reject all the arrivals, i.e., ii^(t) = 0. 

Multihop Power Allocation (MPA): 
In order to ensure the stability of the actual and virtual queues, 
we try to make no node transfer data of a flow to a relay node 
that is not the destination of that flow unless the differential 
backlog of that flow is no less than a fixed value 7. 

Let tran(Z) and rec(7) denote the transmitting and receiving 
node of link I, respectively. We first define 



7; = 



7 if rec(Z) 7^ e 
0 otherwise 



where 7 > 0 is some constant. Let ei(t) = 
argmaxg |q*' in< ''^ e (i) — q r ^°^' e (t) — jf\ be the flow 
on link I that has the maximal modified differential backlog, 



tran(0,ei(t) 



and Wi{t) = max 
For each link I, solve 

max wi(t)m(Pi(t)) 



,rec(J),ei(t) 



(*) - £ 



it) -it 



where 11/ (t) 

tran(Z) 



(t),C (,) (*).*W 

node can transfer a flow to 



(27) 

{Pi(t) : 0 < Pi{t) < 

j. This ensures that no 
a relay node that is not the 



destination of the flow unless the differential backlog of the 
data queues between these two nodes is no less than 7. 
Let Pi(t) be the solution. With the calculated power Pi(t), 

let W t (t) = Wl (t)M(Pt(t)) be 
the weight on link I. 

For the whole network, we either use the MWM or MM 
as described below. We will analyze the performance of our 
solution when using each algorithm. 

Maximum Weighted Matching Algorithm: link I has weight 
Wi(t), then the weight of a matching M is Wj^{t) = 
^2i e jftWi(t). The network chooses a maximum weighted 
matching in a centralized manner, the links in the cho- 
sen matching become active with the calculated transmitting 
power, and other links are not activated. 
Maximal Matching Algorithm: the network chooses a maximal 
matching in a fully distributed manner as in [17], the links 
in the chosen matching become active with the calculated 
transmitting power, and other links are not activated. 

Multihop Routing: 
When wi(t) > 0, transmit for flow that is destined to 
e;(t) with rate i.e., ^ i(t) (P,(t)) = m(P t {t)) and 

Hf(W)) =0- V e9 6e,(t). 

Note that MRC and routing can be done by each node 
independently. We give our main theorem for the multihop 
scenario: 

Theorem 2: If 

(1) yU;(-) is concave on 3? + 1J{0}, and its slope at 0 satisfies 

0 < p = mK°) < 00, e £, 

(2) Vn e N: r n (t) > 0, Vi > 0, and f„ = 
limsupy^^ ^ Eti) 1 r ni t ) < p p^k, 



(3) A feasible solution to Problem (B) exists and the optimal 
instantaneous sensing rate vector is R*(t), 

then the maximum weighted matching based joint rate 
control MRC, power allocation MPA, and routing algorithm 
achieves: 



Qb 



(t)<p( 



V 



(28) 
(29) 



n \ t=0 e ) 

^ E VTJ^ T^E Kit) ^(Mmax + /?) 
n I °° t=0 e 



- 0 ,(l^,l-o,i,, 



(30) 



and the maximal matching based joint rate control MRC, 
power allocation MPA, and routing algorithm achieves: 



C e (t)<V + A m . dX , 
«?(*) <P(V + A msK ), 



(31) 
(32) 



t=o 

T-l 



^ E ^ E E ^ - 'I'yi'" ' 

n \ t=0 e 

-o(<^£)}-o ( I). 



(33) 



The proof of Theorem 2 can be found in Appendix D. The 
results can be interpreted similarly as Theorem 1. 

VI. Numerical Example 

We consider a network topology, shown in Figure 3(a). 
There are 6 nodes, 7 links, and 2 flows with source-destination 
pair (3,1) and (5,2), respectively. There are T — 10 6 time 
slots, each of which is 10 sees long. We use the rate power 
function ^(P ; ) = 101og 2 (l + packets/slot V7 G L. Let 
the power of the background noise = 1.6 x lO -14 !^ V/ € 
L, and the channel gains gi = 1.6 x 10~ 13 , VZ € £. 
Each node is equipped with an infinite data buffer for each 
flow through it, and a battery buffer of size B\, = 800 J. 
The number of arrivals A^(t), t > 0, for all nodes and 
flows, are modeled as independent Poisson random variables 
with mean 20 packets/slot and A m . dx = 31 packets/slot. The 
replenishment process is a random process that is a periodic 
deterministic waveform plus independent Gaussian noise, as 
shown in Figure 3(b) (The cycles imitate the daily solar cycles 
for a solar battery). We also set 77™, the threshold of battery 
outage probability to 0.03 for all n £ K. 

We choose different values of the control coefficient V 
for the proposed algorithm and compare the results with the 
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source 2 



source 1 




destination 2 destination 1 
(a) Network Topology 



5000 10000 15000 

T(slot) 

(b) Replenishment Process 



Fig. 3. Settings 



optimal value 2 . From Figure 4(a), we see that as V increases, 
the average total sensing rate keeps increasing and gets closer 
to the optimal value, which is consistent with Equation (30). 
From Figure 4(b), we see that as V increases, the average data 
queue length (we here only plot the data queue length of node 
2 for flow 1) keeps increasing but is upper bounded by the 
bound we get in Equation (28). From Figure 4(c) we observe 
that the battery discharge probability (we only plot for node 
1) increases to the threshold as V increases. Thus, increasing 
V can increase the average total sensing rate, but the cost is 
increasing the average data queue length and battery discharge 
probability of at least one node. 




v 

(a) 





Fig. 4. Performance of the MWM based algorithm. Impact of the control 
parameter V on (a) the average total sensing rate, (b) average data queue 
length, and (c) the battery discharge probability 



Similarly, from Figure 5(a), we see that as V increases, 
the average total sensing rate keeps increasing and gets closer 
to some value that is larger than half of the optimal value, 
which is consistent with the result given by Equation (33). 
From Figure 5(b), we see that as V increases, the average 
data queue length (we here only plot the data queue length 
of node 3 for flow 1) keeps increasing but is upper bounded 
by the bound we get in Equation (31). From Figure 5(c) we 
observe that the battery discharge probability (we only plot 
for node 6) increases to the threshold as V increases. 




v 

(a) 



Average Data Queue Length 
Upper bound 




-The optimal value can be obtained by using dynamic programming. The 
details are omitted here. 



V 
(b) 



Fig. 5. Performance of the MM based algorithm. Impact of the control 
parameter V on (a) the average total sensing rate, (b) average data queue 
length, and (c) the battery discharge probability 



VII. Conclusion 

In this paper, we studied the problem of energy management 
in rechargeable wireless sensor networks. Our objective was 
to maximize the average data sensing rate subject to QoS 
constraints on both data and battery queues. We provided a 
simple and unified framework of joint rate control and power 
allocation for all combinations of finite and infinite data and 
battery buffer sizes. We showed through both analysis and 
simulation that the performance of our strategy is close to 
that of the optimal solution. We extended our algorithm to 
the multihop scenario and showed that simple extensions of 
our index schemes for single hop generalize to the multihop 
scenario with a similar, close-to-optimal performance. We 
developed a distributed joint rate control, power allocation and 
routing algorithm for multihop networks under node-exclusive 
interference model based on imperfect scheduling. Although 
we do not consider channel variations in this paper, taking 



8 



channel variations into consideration is technically straightfor- 
ward. Our multihop network formulation does not consider the 
fairness among links, however, the opportunistic scheduling 
framework [18] used to resolve the fairness issue can be 
applied in this context. For future work, we are extending this 
framework to the general interference model where the rate 
power function is not concave. 
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Appendix 
A. Proof of Proposition 1 

Using the idea similar to [14], we have the fact that 
if any queue represented with Q(t) is strongly stable, 
then limsupy^^ = 0. Hence, if q d (t), q b (t) and 

q b (t) are strongly stable, then limsupj^^ = 0, 

limsupy^^ = 0 and limsupj^^ ^4p- = 0. 

From Equation (11), we have q d (t + 1) > qd(t) — rjdR{t) + 
D(t) - D(t) - fi(P(t)) + R(t) + I(t). Note that q d (t + 1) = 
q d {t) - fi(P(t)) + I(t) + R(t) - D(t). By adding from 0 to 
T — 1, dividing by T and taking limsupj^^ on both sides, 
we have 



,. Qd(T) . ,. &(0) 
iim sup — — — > iim 

T^oo 2 T^oo 1 



T-l 



lim sup 



1 T_1 

r)d lim sup - V R(t)+ 

Qd(0) - q d {T) 



t=0 



- lim 



Since lim sup T _ 
liniT 



1 I -^-OO rp 



Mil = 

oo T 

0 and limy. 



0, liiriT_ 

'OO j- 1 



M0) = 0, 
0, so we get 



< 



lid- 



p 0 = lim sup T ^ c 

Similarly, from Equation (12), we have q b (t + 1) > q b (t) — 
r]o + P(t)-r{t)+M(t) + l 0 {t). Note that q b (t + 1) = q b (t) - 
P(t) + r(t) - M(t). By adding from 0 to T - 1, dividing by 
T and taking limsupj^^ on both sides, we have 



lim sup — — — > lim 

T— >oo T ~ T^oo 



T-l 



— r7 0 + limsup- > l 0 (t) 

1 T — 1 t=o 

0,(0) - q b (T) 



+ lim 

Since limsup^^ = 0, lim^oo ^ : I). 

Hindoo = 0 and lim T 

Po = limsupy^ i J2^=o < Vo 



M91 

T ~ 

= 0, so we get 



B. Proof of Theorem 1 

Proof of Equation (15): Note that I(t) < (J,(P{t)) and 

R(t) < A max . The rate allocation unit RC is chosen to satisfy 
Equation (15). 

Proof of Equation (16): Since ^(-) is concave on 3?+ [J{0}> 
we have fJ,(P(t)) < (j,(0) + /3P{t) for P(t) e U(t), Vi > 0, 
where 0 < 0 = ^'(0) < oo. Then, Q d (t)/j,(P(t)) - 
q b (t)P(t) < Qd{t)fL{Q) + [3Q d {t)P{t)-q b {t)P{t) where P{t) 
is the solution of PA. 

If (3Q d {t)P(t)-q b {t)P(t) < 0,thenwegetQ d (tV(P(t))- 
q b (t)P(t) < Qd(t)/i(0). However, PA chooses P(t) that maxi- 
mizes Q d (t)n(P(t))-q b (t)P(t) which means Q d (t)n(P(t))- 
q b (t)P(t) > Q d (t)ii(0) since 0 e II(t). Then we must have 
PQd{t)P{t) - q b {t)P{t) > 0, i.e., pQ d {t)P{t) > q b (t)P(t). 
Thus, if P(t) > 0, q b (t) < PQ d {t) < + A m , x ); if 

P(t) = 0, by Equation (10), l 0 (t) = 0, and we also have 
M(t) < r(t), then q b (t) does not increase anyway. Therefore, 
Qb{t) < @{\ + A m . dx ) for all t, which is Equation (16). 
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Proof of Equation (17): Equation (17) is trivial when B b < 
oo. We only need to consider the scenario when Bb = oo. 

First, we provide a rough idea of the proof: by exploring 
the relations between %{t) and q b (t), we notice that as q b (t) 
increases from 0 to at most r max + /3(-^ + A mdx ), q b (t) will hit 
zero at some slot. Once q b (t) becomes zero, PA either drains 
the battery queue or allocates P pe ak- Since P pea k > r, Qb{t) will 
either be under r max or start to decrease after finite number of 
slots. We now give the proof details. 

Without loss of generality, let q b (0) = 0. We have the 
following cases: 

i) if P(t) > r(t), l a (t) = 0 and q b (t) > 0, then q b (t + 1) < 
q b (t) and q b (t + 1) - q b (t) < q b (t) - q b (t + 1), i.e., even if 
q b (t) increases, the increment is no larger than the decrement 
of q b (t); 

ii) if P(t) < r(t), l 0 {t) = 0, then if q b (t +1) > 0, 
q b (t)—q b (t+l) > q b (t+l) — q b (t), i.e., the decrement of q b (t) 
is no less than the increment of q b (t), else if q b (t + 1) = 0, it 
goes to case iv); 

iii) if l 0 {t) = 1, then q b (t+l) = r(t) < r max by definition of 
discharging event Equation (10); 

iv) if q b (t) = 0, by Equation (14), PA chooses P(t) = 
min[g 6 (i), Ppeak], then either q b (t + 1) = r(t) < r max , or 
q b (t + 1) = q b (t) — Ppeak + r(t). For the latter case, if 
Ppeak > r(t), then the battery queue state decrease and 
q b (t+l) = q b (t)-q b (t+l) = P peak -r(i) > 0; if P pea k < r(t), 
then q b (t) — 0 and case iv) continues. However, P pea k > f, 
which means after at most K < oo slots, r(t + K) < P pea k 
and the battery queue state starts to decrease. 

Combining the above discussion with Equation (16), we 
have that Vi > 0, there 3t 1 {t),t 2 {t) > 0 such that 

q b (t) < min[ gfc (0),r max ] + \q b (h(t)) - q b {t 2 {t))\ 

+ if (r maI — Ppeak) 

V 

<{K + 1) 

^max ~t~ P( 0 ~t~ ^max) ^ 00 > 

which is Equation (17). 

Proof of Equation (18): We define the Lyapunov function 

L(Q d (t),q b (t)) = Q 2 d (t) + q 2 b (t), and A(Q d (t),q b (t)) = 
L(Q d (t + l),q b (t + 1)) - L(Q d (t),q b (t)). 
I) B d = oo and B b = oo. 

From Equation (13), we have q b (t + 1) < (q b (t) — r] D ) 2 + 
(l 0 (t) + P(t) - r{t)f + 2{q b {t) - Vo ) + (h(t) + P{t) - r(t)). 
Also from the data queue dynamics, we have q\(t + 1) < 
qj(t) + v 2 (P(t)) + R 2 (t) + 2q d (t)R(t) - 2q d (t)^(P(t)), then 

A = A(q d (t),q b (t)) 

< ^ 2 {P{t)) + R 2 (t) + 2q d (t)R(t) - 2q d (t)ti(P(t)) + (1+ 
Ppeak) 2 + r, 2 nax + T] 2 0 + 277 0 r max + 2q b (t)(l 0 (t) + P(t)- 
r(t)) 

< (1 + Ppeak) 2 + vl + r 2 max + 2?7 0 r max + fi 2 m . dx + A 2 m . dx (t)+ 
VR(t) + 2q b (t)(l 0 (t) - r(t)) + 2 [q d (t) - V/2] R(t)- 
2[q d (t) f i(P(t))-q b (t)P(t)]. 



It is apparent that RC is trying to minimize the term 
[q d (t) — V/2] R(t), and PA is trying to maximize the value 
of the term [q d (t)fj,(P(t)) - q b (t)P(t)]. Since the optimal 
solution for Problem (A) may not be unique, we let CP* 
be the optimal solution set and P* e CP* be any optimal 
solution, for Problem (A). In time slot t, let P m (t) be 
the value that maximize the unconstrained objective func- 
tion q d (t)n(P(t)) - q b (t)P(t). Since U{t) = {P(t) : 
0 < Pit) < mm[q b (t), P pea k]}, only when q b (t) < P pea k 
and P m (t),P*(t) i. U(t), we may have q d (t)fi(P(t)) - 
q b {t)P{t) < q d {t)fi{P*{t)) - q b (t)P*(t). Thus, we have 

A < (1 + Ppeak) 2 + ril + r 2 m . dx + 2 Vo r m . ix + ^ + A 2 m . dx + V( 
R(t) - R*(t)) + 2q d (t)[R*(t) - ti(P*(t))] + 2q b (t)l 0 (t) 
+ 2q b (t)[P*(t) - r(t)] + 2[q d (t)(n(P*(t)) - fi(P(t))) + 
q b (t)(P(t)-P*(t))]l 

When P m (t) i U(t) and q b (t) < P peak , PA will allocate 
q b (t) amount of energy for transmission in order to maximize 
q d (t)n(P(t))-q b (t)P(t) within U(t). Under this situation, we 
must have l Q (t) = 1 since r(t-l) > 0. Further, P*{t) <£ Tl(t) 
means P*(t) > P(t). Thus, 

A<(1 + Ppeak) 2 + Vo + r max + ^Vo r max + A*m ax + ^-m a x + ^( 

R(t) - R*(t)) + 2q d (t)[R*(t) - n(P*(t))] + 2q b (t)[P*(t) 
- r(t)] + (V + 2A mdX )(/3 + Mmax)io(t). (34) 

Lemma 1: Under the same conditions as those in 
Theorem 1, there exists an optimal policy under 
which the actual battery queue is strongly stable, i.e., 

limsupr^ ^ J2l=o 9&0) < °°- 

Proof: Suppose there is an optimal policy under which 

lim supy^^ ^ J2j=o ll (*) = 00 • This means there exists 
a subsequence of times {t n } such that t n — > oo and 

h SfcO 1 Ibit) °°- In 0ther WOrds ' V( 9 > °> 3t N G {tn} 

such that Vtfe E {t n } and t k > t N , we have j- J^t'Lo 1 > 
Q. 

Let AT(i) = min{n : \ r ( f ) = f + e (*)}> where 

|e(i)| > 0 can be chosen arbitrarily to ensure 0 < f + e(i) < 
Ppeak and N(i) < oo, \/i > 0. Let Q* = maxi> 0 N(i) [r + 
e(i)] < oo. 

Find Q and the associated such that ^ X^t^o^ 1 > 
Q > Q*. Since t N < oo, 30 < T 0 < t N - 1 such that 
ql(T Q ) > Q > Q*. Then, if we modify this optimal policy 
by keeping P*(t) unchanged Vf < T 0 , and letting P*(t) = 
f + e (t) = r + e(T 0 ), for T Q < t < T 0 + N(T 0 ), P*(t) = 
f + e(t) - f + e(T 0 + iV(T 0 )), for T 0 + N(T 0 ) < t < T 0 + 
N(T 0 ) + N(T Q + AT (T 0 )) and so on. This ensures that the 
battery state will equal the starting state ql(T 0 ) after N(T 0 ) 
slots and battery discharge does not happen in between. This 
process continues, as shown in Figure 6. Thus, we will not 
violate the constraint on the battery discharge ratio. 
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Fig. 6. Battery State Evolution under the Modified Optimal Algorithm 



Under the modified policy, we have 



1 T_1 
liminf- £>(P*(t)) 

t=o 

"T 0 -l T-l 

5>(p*(f)) + 5>(r+ e (t)) 

. t=0 t=T„ 



T-*oo T 

liminf — 

T^oo T 



/i(r + e) 
(35) 



By Jensen's inequality, we have 



T-l 



T-l 



t=0 



limsup i £ < M(limsup i £ P*(i)) 

1 T_1 

< ^(limsup - V r(t)) = /j,(r), 

T^oo J — 



4=0 



which means the feasible optimal sensing rate is strictly 
less than the maximum achievable service rate n(f). From 
Equation (35), we can choose e(t), Vt > 0 such that the 
modified policy provides a service rate which can support 
the original optimal sensing rate. In addition, we want to 
ensure that 0 < f + e(i) < P pea k and N(i) < oo, Vz > 0 
as indicated before. A sequence e(t), Vt > 0 that satisfies 
these conditions exists since liminfr^oo ij* Y^t=o r M = 
then the data queue is still strongly stable under the modified 
policy. Thus, the modified policy is still optimal. 

Further, under the modified optimal policy, g£(t) < 
^maxPo < oo, Vt < T 0 , and q£(t) < r max T 0 + Nr m!lx < 
oo, Vi > Tq, where N = maxj>r 0 N(i) < oo. Thus, the 
battery queue is strongly stable under the modified optimal 
policy, i.e., limsupy^^ ^ J2t=o < °°- As a byproduct, 

limsupy^ ^4p- = 0. ■ 

Note that q£(t + 1) = - P*(i) + r(t) for the optimal 
policy. By multiplying q b (i) on both sides and rearranging 
terms, we obtain q b (t)[P*(t) -r(t)] = -gj(f + l)]. 

Further, -r max < < /3(y + Anax)- With Lemma 1, 

by summing from 0 to T — 1, dividing by T and taking 



liminf y^oo, we have 



T-l 



t=o 

g 6 (0)g 6 *(0)-g 6 (T)g 6 *(T) 



lim inf ■ 

T^oo 



T 



1 

+ ijminf - V qZ(t)(q b (t) - q b (t 

1 — >oo 1 * — * 

t=l 

1 T 

< (1 + Ppeak) lmi inf = £<Z 6 *W, 



-1)) 



(36) 



which is a finite constant that is not related to V. 

Note that q*(t + 1) - (<£(t) - n(P*(t)))+ + R*(t) > 
q*t(t)-fi(P*(t)) + R*(t). By multiplying both sides with q d (t) 
and rearranging terms, we obtain qd(t)(R*(t) — fi(P*(t))) < 
q d (t)(q d (t+l)-q d (t)). With the optimal policy, q* d is strongly 
stable. By summing from 0 to T— 1, dividing by T and taking 
lim inf 7^00, we have 

1 T_1 

liminf - Y, q d {t)[R*{t)-n{P*{t))] 

1 — *oo 1 — 
t=0 

q d (T)q d (T) - q d (0)q* d (0) 



< lim inf ■ 

T- 



T 



1 

liminf- ^ <7S(i)((fe(t-l) -?*(*)) 

1 — *oo _L 



{=1 



< Am,, lim inf 



T^oo T 



1 



(37) 



which is a finite constant that is not related to V. 

By summing from 0 to T — 1, dividing by T and V, taking 
liminfx^oo over Equation (34), combined with Equation (36), 
and Equation (37), we get 

^ T-l ^ T-l 

h > inf ff 12 - li ^ ini r J2 R *w - + p) 

I KYI / * ■ / »(Y1 / ' 



t=0 



t=0 



T^oo T 

II) B d = oo ant/ B b < oo. 

From Equation (12), we have qf(t + 1) < (<ft(i) — ?7o) 2 + 
(7 0 (t)+P(t)-r(t) + M(i)) 2 + 2(g b (<)-r; 0 ) + (7 0 (t) + P(t)- 
r(i) + M(*)), then 

A = A(q d (t) , q b (t)) 

< (1 + Ppeak)' + ^ + r'ax + 2%r m ax + A^W) + B 2 ' (t) 

+ VR(t) + 2 [q d (t) - V/2] R(t) - 2[q d (t)ix{P{t))- 
q b (t)P(t)] + 2q b (t)(l 0 (t) - r(i) + M(t)) 



< (1+ Ppeak) 



\2 



2 i 2 



y(P(t) - R*(t)) + 2q d (t)[R*(t) - »(P*(t))] + 2q b (t)[ 
P*{t) - r(t) + M*(t)] + (y + 2A m . dX )(p + Mm«)io(t) 
+ 2q b {t)M{t) 

Without loss of generality, let g&(0) = 0. Similar to the 
proof of Equation (17), we have the following cases: 
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i) if P(i) > r(t), l 0 {t) = 0 and q b (t) > 0, then M{t) = 0, 
q b (t + 1) < q b (t) and q b {t + 1) - g 6 (t) < Qb{t) - q b {t + 1), 
i.e., even if q b (t) increases, the increment is no larger than the 
decrement of q b (i); 

ii) if P(t) < r(t), l 0 (t) = 0, then if q b (t + 1) > 0, q b (t) - 
q b (t+l)=r(t)-P(t)-M(t)-7 lo + (? lo -q b (t))+ > q b (t + 
1) - q b (t) = r(t) - P(t) - M(t), i.e., the decrement of q b (t) 
is no less than the increment of q b (t), else if q b (t + 1) = 0, it 
goes to case iv); 

iii) if l 0 (t) = 1, then <fa(t + 1) = min[r(£), B b ] by definition 
of discharging event Equation (10). If M(t) > 0, then this 
means r(t) > B b and -r(t) + M(t) = -B b ; 

iv) if q b (t) = 0, then q b (t)M(t) = 0 anyway. 

Combine the above discussion with Equation (16), we have 
that Vi > 0, if M{t) > 0 and q b (t) > 0, we must have 
qbit) < {3(\ + A mnx ) + max[r max , 1] - B b . Thus, 

T-l 



t=0 



< 



(/3(£ + A max ) + max[r max , 1] - B b )+r n 



V 



The remaining argument is similar to case I), and we obtain 



^ T-l ^ T-l 

liminf - R(t) > liminf - £ iT(t) - Vo(^ 



(3) 



t=o 



t=o 



o(i)-o,<i^). 

Ill) Bd < oo and B b = oo. 



V 



At(P(t)) + /(*) + R(t) - D{t))+, q d {t) decreases no slower 
and increases no faster than q d (t) until q d (t) hits zero. 

Note that only when qd(t) < Umax, I{t) may be strictly 
positive. Then combine with the above discussion, we have 

T— 1 

1 ijminf 4Z>(W) 



V T 



< 



t=0 
I - Mmax 



0( (i^r 



)■ 



Construct an auxiliary queue <?*(i) with the following 
evolution: 

g*(t + 1) = ?(t) - (VdR*(t) + e) + D*(t), 

where e > 0 can be arbitrarily small. Since for an optimal 
policy, 



T-l 



lim sup 



T-l 



< i]d lim inf 



t=0 



t=0 



1 T_1 

< lim sup - Y^(r, d R*(t) +e), 



so is strongly stable. 

Note that &(*)[£>*(*) -%i2*(t)] - + 
e]. By summing from 0 to T — 1, dividing by T and I/, and 
taking liminfr^oo, we have 



From Equation (13), we have <^(t + 1) < (qb(t) - rj 0 ) 2 + 
(l 0 (t)+P(t)-r(t)) 2 +2(q b (t)- Vo )+(l 0 (t)+P(t)-r(t)).Also 
from Equation (11), we have q 2 d {t + 1) < (<7d(i) — r/ d R(t)) 2 + 
(/(*) + P(i) -m(^W)) 2 + 2(q d (t) -VdR(t))+(I(t) + R(t) - 
fi(P(i))), then 

A = A{q d {t),q b {t)) 

< (1 + Ppeak) 2 + + r 2 ax + 2r/ 0 r max + M 2 (P(i)) + (1 + ^) 
• P 2 (t) + VR(t) + 2q b (t)(l 0 (t) - r(t)) + 2q d {t)I{t) + 2 
[(1 - Vd)Ut) - V/2]R(t) - 2[q d (t)KP(t)) - q b (t)P(t)} 



T-l 



1 liminf 1 ^ &(*)[£>* (i) - %P*(i)] 



V T 
1 



t=0 



. - liminf g ' rf(r)r(r) ~ g ' d(0)r(0) I g ' d(t) C 

- v t^oo t y 

T 

+ 1 liminf 1 - 1) - &(*)) 



t=i 



1 



<(1+Ppe ak ) 2 +^ 



Mmax 



2 

max 



< ^ A max lim mf - 2^ 9* (*) 

t=i 



V^2 + 



+ yp(t) - Fir(t) + 2%(i)[p*(i) - /x(p*(t)) + r (t)- 

D*(t)] + 2g (I (t)[D*(t) - %ir(t)] + 2%(t)[P*(t) - r(t)] 

+ 2q d (t)I(t) + (V + 2A m , x )(/3 + Mmax)io(t) 

Without loss of generality, let qd(0) = Bd- Similar to the 
proof of Equation (17), we have the following cases: 

i) if D(t) > 0, then from q d {t + 1) = (q d {t) - pt(P(t)))+ + 
R(t)-D(t) and I{t) = (fj,(P(t))-q d (t))+, I(t) can be strictly 
positive only when A(t) > Bd- However, whenever D(t) > 0, 
q d {t + l)=B d . 

ii) if D(t) = 0, from q d {t + 1) = q d {t) - n{P{t)) + I(t) + 
R{t) - D(t) and q d (t + 1) = ((q d (t) - r) d R{t))+ + D(t) - 



The remaining argument is similar to case I). Further, by 
letting e — > 0, we obtain 

. T-l . T-l 

liminf - V J2(t) > liminf - V iT(t) - »j 0 (/w + /?) 

i — >oo 7 ^ — ' T — *oo / ^ — ' 



t=0 



t=0 



1 



Pd)- 



IV) Bd < oo and B b < oo. 
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Simply by combing case II) and case III), we obtain 

T-l , T-l 



t=0 



)- 



t=0 



_0(I,- 0 (<^-*» + 



V 



)• 



C. Proof of Corollary 1 

Similar to the proof of Proposition 1, we have the fact 
if q£(t) is strongly stable, then limsupj^^ = 0. 

From Equation (26), we have qg(t + 1) > q£(t) - rfc + 
E* e n n u e n P M ~ r n (t) + M n (t) + l^(t). Note that (t + 
1) - g?W-Ei 6 n BU e„ flW+rn(*)-M n (t). By summing 
from 0 to T — 1, dividing by T and taking limsup of both 
sides, we have 



hm sup — > hm — 

T ~ T^oo T 



T-l 



T^oo 



■ lim 

T^oo 



g6 "(0)~ gb "(T) 
T 



t=o 



Since limsupy^^ %^ = 0, limsupj^^ = 0 

limsupy^^ gb = 0, and limT^oo = 0, so we get 
Po = I™ sup^ i EL'o 1 7 " (*) < , VneN. ■ 

Z). Proof of Theorem 2 

Proof of Equation (28) and Equation (31): We prove Equa- 
tion (28) by induction. Let q d ' dx (t) be the maximum data queue 
length for all flows at slot t. Assume that q™ x (t) < \ + A max 
(holds for t = 0 by letting g£' e (0) = 0, Vn, e e M), need 
to show that it holds at slot t + 1. Consider the data queue 
9d ' e (*+ 1) maintained at any node n for flow destined to any 
node e^nat slot t+ 1. If node n received data destined to e 
from other node m at slot t, then by the routing policy in Sec- 
tion V and definition of tu (m , n )(t), i- ■<#' ! (t) > 7 ( e m ,n)' 
where (m, n) is the link from node m to node n. Choose 7 
such that the resulting backlog of the receiving node is not 
longer than that of the transmitting node (let ^™ ax to be the 
maximum endogenous arrivals, then 7 = /i|J} ax + A m . dx satisfy 
this condition), we then have q d ' e (t + 1) < q d ' e (t) + 7, then 
q? e (t + 1) < C e (i) < <#"(t) < \ + A max . If node n 
did not receive any data destined to e from other nodes, then 
it can only have exogenous arrivals. Clearly q d ' e (t + 1) < 
q d ' e (t) < -j + A max if there were no exogenous arrivals. 
If there were exogenous arrivals, by MRC of Section V, we 
must have q d ' e (t) < y, then q d ' e (t + 1) < ^ + A max . Thus, 
Equation (28) holds. Equation (31) can be shown using the 
same argument. 

Proof of Equation (29) and Equation (32): Since 

W(^W) < W(0) + for Pi(t) € n,(t), Vt > 0, 

- (C (0 (*) + C C( V*W < ^(*)w(o) + 

p Wl {t)Pi{t) - (f b w(l \t) + q™ (l) )Pi(t) where P t (t) is the 
solution of Equation (27). If /3wi(t)Pi(t) - {q™ n(l] \t) + 



C ( V;(*) < 0, then we get u;,(t)w(^(*)) - + 
% c ^)Pi{t) < wi(t))j,i(0). However, solution of Equation (27) 
chooses P;(t) that maximizes wi(t)fXi(Pi(t)) — (q" n< " l \t) + 



qr {l) )Pi(t) which means «;,(*)w(^(*)) " (C^'W + 
qT {l) ) p l(t) > wi(t)m{0) since 0 e Then we must 

have - (qt m{l) (t) + > 0. Using a 

similar argument as in the proof of Equation (16), we obtain 

(«T (0 (*) + C C( °) < < /3«T + ^ax) for all 

t, and this holds for all I € L. If none of the incident links 
of a node is active, then by Equation (26), the virtual battery 
queue length does not increase anyway. Thus, Equation (29) is 
proved. Equation (32) can be shown using the same argument. 
Proof of Equation (30): Define L(q d (t) , qb(t)) = 
E„, e (?r(i)) 2 + E„(C(i)) 2 , then 

A(t) = L(q d (t + 1), f 6 (i + 1)) - L{q d {t), &(t)) 

< ]T[^max + 2A maxAimax + 2 M LJ + + ^ P eak) 2 + 

n.e n 

(%") 2 + rLc + 2%r max ] + 2 ^ gj' e (t)i^(t) - 2 ^ 

■ (? «m(0.e.(*) (t) _ gj c (').e.(t) (t))w(fl(t)) _ 

(t) + C c(0 (t))Pi (t)] + 2 2 %(t) [l n 0 (t) r n (t)+ 

n 

M n (t)} 

Note that if q «MDMt) {t) _ ficWMQ^ < 7j e«(*) > 
then toi(t) = 0 and m(Pi(t)) = 0, then 

(C (0,ei(t) W - ^(MM) = m{t)m{m)); 

otherwise, ^"'"("(t) - q ^ e ^\t) > Wl (t), then 

(C w,ei(t) w - qT (l)Mt \t))^m)) > mmm))- 

We then have 

A(t) < ^[A^ + 2A maxA i max + 2 M Lx] + E^ 1 + P P eak) 2 + 

n,e n 

«) 2 + r-Lc + 2 % r max ] + 2 £ #(t)[7»(t) - r n (t)+ 

n 

M n (t)\ + 2Y,qTm e n (t) -2Y J [m{t)m{Pi{t))- 



-tran(0/ 



Note that EieAMVMm)) - («T ( °(*) + 

C C(0 W)W] - E ie M (t) N(t)w(^W) - («T (0 (*) + 
C^W)^^)] > E ieM . w NWw(^ s W) - (C n(0 W + 

g)j ec ^^(t))P ; s (<)], where M(t) is the matching chosen by 
MWM algorithm and M*(t) is the matching picked by 
optimal policy in slot t, and Pf(t), V7 e £ are the 
suppositionally calculated power in MPA. Since the objective 
function of power allocation component is separable over 
links and for each link it is concave, using similar argument 
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as in single link case, we have where -D™' e (i) is the amount of overestimated endogenous 

A(t\<X^\A 2 +9 A u +9„ 2 l+Vm + P arrivals t0 the qU6Ue Since S; e e n Mf(^W) may be 

^ 2^^max + z/i maxAimax + ^maxJ + Z^^ 1 + ' r P eak '' + larger than the actual endogenous arrivals; D%£(t) is the 

amount of overestimated departures since £; e n A*f(-FK*)) 
may be larger than the actual departures. Further, at slot t, 



(Vo) 2 + rLx + 2?7 0 r max + (V + 2A max )(/3 + U max )7™ 



9/ (*) 



if node n does not have endogenous arrivals for flow e, then 
D™' e (t) = 0; if flow e is transferred from node n to node m, 

E_ "7 ,,, *'", n then D™'?(t) — D™' e (t). By choosing 7 = A max + jU max , the 

/ tran(;),e/,N rec((),e/,\ \ e/p*/,\\ , n out w m v / J a i max f~maA> 

/ j \1d \ T ) ~ Qd \ T > ~~ (m \t) ) + * resulting backlog of the receiving node is no longer than that 

™' e of the transmitting node. We then have 



E (*) E - r «w + <w + M «w 

n |_ieo„ u e„ 



n,e 

) 2 + (»?" ) 2 + r max + 2??o7-max + (V + 2A max )(/3 + 

• Kit)] + Vj2(K(t) K(t))+2j2%(t)Mn(t)+ 



E«rwr(t+i)-9rw) 

- E MfOTO) 



2E^ e (*) 



<w+ E-E koto) 

\iee„ iefi„/ 

E Pt{t)-r n {t) + M*{t) 
ien n u e„ 



>E «?"(*) 



(38) 

Note that tf*(t + 1) = CW - E/ e n nU e„ P r(*) + 
r„(f) — M*(t) for the optimal policy. By multiplying 
q£(t) for both sides and rearranging terms, we 

obtain qg(t) [j2 le n n u e„ P t(t) - r n (t) + M*(t) 

q£ (*)[«?*(*) - C(* + !)]■ % summing from 0 to T - 1, 

dividing by T and taking lim inf 7^00, we have 



iefi„ 

Since g£' e (i) - g£' e (i+ 1) < Anax + Mmax, Vi > 0, Vn, e e 
N and g^ e * are strongly stable Vn, e€N, then 

W ^ E E «?"(*)(«?"*(* + 1) - ?r (*)) 



^E 

n.e 



£=0 n.e 

lim <zr(T) ! r * (r) +(A max + A , max ) 



T 



?r(o)«r*(o) 



• lim inf — q\ ' e * (t) — lim 



T-l 



KrEE^') 



t=0 n 



E Pf(t)-r n (i) + M*(i) 
zeo„ u e„ 



< 



E 



(ir + A max ) lim grf 1 + (A max + li^) 
Z 1 — >oo 1 



= > lim inf 

^ T^oo 



(=i 



liminf -EE (*)(#(*) - 5T(* - 1)) 



1 T 

< E( A ™- + ^ l ^ ini r E c e *(t) 



(39) 



t=l n 



< (1 + Ppeak) ljm inf - E E 9"* (*) . 

t=l n 

which is a finite constant that is not related to V. 

The argument of case II) in the proof of Theorem 1 can be 
directly applied here, and we have 

v ^ 1 t E E mw) < E o( { " v v Bll)+ ). 

t—0 n n 

Note that the data queue dynamics can be written as 

C e (t+ 1) = q n d e (t) + K(t) + E rf(W)) E 

/if(^(*))-£>r (*)+«(*). 



which is a finite constant that is not related to V. 
Thus, we have £ lim inf i EL'q 1 £„ e + 
E iee „ Mf(^W)-E/ e n n tf(P?(t))] < 0(f)- By summing 
from 0 to T — 1, dividing by T and F, taking lim inf T ^ 00 for 
Equation (38), we get 

T-l 



eI^^ee^) 



1 



t=0 e 
T-l 



^ E tEE R nV) - + Z 3 ) 



+ 0( 



t=0 e 



) -O(-). 



which is Equation (30). 
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Proof of Equation (33): Define L(q d (t) , %{t)) = 
\ EnJirW + 5 T^M(t))\ and then 

A(t) = L(q d (t+l)^ b (t+l))-L(q d (t),^ b (t)) 

< ^[^max + (^max + T^max + /4aJ + \ ^[(1+ 

n,e n 

^peak) 2 + «) 2 + r 2 m . dx + 2 Vo r m , x +(V + 2A m , x )((3+ 

^mt)] + vj2K(t) + \ E^' e w - 



where VZ e £, / e 5" has the following evolution: 



WR f n*{t)H l f *. (40) 



n 



£ irw -»■„(*) +M*(t) 

ien„ u ©». 



+ 



E - E A*f(^(*)) j [ie»c(t)] 



Ec(*) M »(*)- 



Under the optimal policy, the amount of data destined to the 
same destination may be routed through different routes. We 
first logically decompose the flows that are differentiated by 
destination into flows that are differentiated by routes under 
this optimal policy. Let 2f denote the set of flows after the flow 
decomposition, then any flow / e J has a fixed route. Let e(/) 
denote the original flow from which flow / is decomposed, 
and Uf(l) denote the one hop upstream link of link I in the 
route of flow /. Let H* = {H 1 ^} denote the routing matrix 
under the optimal policy, where = 1 means the data of 
flow / is routed through link I. We further logically decompose 
the data queues at nodes into data queues at links under this 
optimal policy. Since the sum queue length of the decomposed 
queues equals the original queue length, we have 



£# e (*) \k® + ( E - E J Mmwiiew)] 

■ E \lC a(l)Mf \t)RL {l) (t) (C (0,e(/) W- 



q7 {l)Mf) (t))rf(p;(t)) 



[l€M(t)] 



Etr 
Id 

U 



tran(0,e(/) 



(*) 



V Rf*(t)H Uf(l) * 
d 9 ^u f (i)y F u f (i) W)V/WeM(t)] 



+ 5 



< 



EC (0 ' e(/) W (Q l d f *(t + i)-Q l d f *(t))- 
IJ 

Ef ( ' w/) W (Q u d f{l)J *(t+i)-Q u d f{l)J *(t)), 



+ 



Since -/w < q d ' e (t) - q d ,e (t + 1) < ^4 max + Mmax, Vi > 
0, Vn, e € N, in order to apply the same idea of Equation (39) 
to bound the above term, we only need to show that all the 
accumulative data queues Q 1 /*, VI e L, f e 3 are strongly 
stable. We know that all the data queues are strongly stable 
under the optimal policy queue evolution, then the queues are 
still strongly stable after flow and link decomposition. This 
further implies that the accumulative queues are strongly stable 
(note that the accumulative queues being stable does not imply 
each queue in the routes are stable), i.e., the queues with the 
following evolutions are strongly stable: 

Q 1 /* (*+i) = (qT w - dip? (*))) + + E R n m 1 / ■ 

n 

(41) 



Let (if* = liminfT^oo ^ J2t=o A and K* = 

liminfr^oo £ J2f=o E n R f n *(t) H f*- Without loss of gener- 
ality, assume fj,j > 0. We know the optimal sensing rate 
is feasible, so Xf* < fl{* . Then the queues with evolution 
Equation (41) being stable is equivalent to the normalized 
queues with the following evolutions being stable: 

<l l /*(t + l) = (q l d f *(t)-l) + + ^. (42) 

Note that l[i e M(t)] is a random variable which is indepen- 
dent of P*(t). Thus, l[i e M(t)] an d H\ (P*(t)) are independent 
and then uncorrected, so 



1 T_1 

!™m f f E ( p r( i ))^[ieM(t)] 

t=0 

j T-l j T-l 

h T minf t E W) liminf - E 7 

J — >oo 1 * — ' 1 — >oo 1 * — ' 

t=0 t=0 



[ieavt(*)] 



T-l 



= liminf ^ 

T^oo T 



[ieM(t)]- 



t=o 



In order to show that all the queues with evolution Equa- 
tion (40) are strongly stable, we only need to show all the 
normalized data queues under the MM based algorithm with 
the following queue evolutions are strongly stable: 

ti f \t + 1) = U/*(t) l[ieM W] ) + + (43) 

Equation(42) implies Equation (43) by a similar argument 
and proof techniques as in [17]. ■ 



